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Abstract — The downlink of a multiple-input multiple output 
(MIMO) broadcast channel (BC) is considered, where each 
receiver is equipped with a single antenna and the transmitter 
performs nonlinear Dirty-Paper Coding (DPC). We present an 
efficient algorithm that finds the optimum transmit filters and 
power allocation as well as the optimum precoding order(s) 
possibly affording time-sharing between individual DPC orders. 
Subsequently necessary and sufficient conditions for the optimal- 
ity of an arbitrary precoding order are derived. Based on these we 
propose a suboptimal algorithm showing excellent performance 
and having low complexity. 

I. Introduction 

We study the multi-antenna downlink where each receiver 
has only a single antenna. From an information theoretic 
perspective, the channel can be modeled as a multiple-input- 
single-output (MISO) Gaussian broadcast channel (BC). As- 
suming that the transmitter has knowledge of the channel 
and can perform Dirty-Paper Coding, we are interested in 
the optimum transmit filters, power allocation as well as the 
optimum precoding order to achieve a certain set of SINR 
targets or equivalently a set of target rates specified in ad- 
vance. This perspective reflects the recent need for fairness in 
mobile communication systems and conflicts with throughput 
maximization. 

The problem of sum power minimization has been well 
studied in the beamforming literature and relying on Perron- 
Frobenius theory it was solved for linear processing (see e.g. 
[1], [2] and references therein). Further it was generalized to 
the case of nonlinear DPC beamforming in [3]. In contrast 
to the case of purely linear beamforming the system is not 
interference limited for DPC. However, a drawback of the 
mentioned approach is that it solves the problem for a fixed 
DPC order. An inherent problem is that for every DPC order 
the interference coupling matrix has a different structure. So 
no method is known to jointly optimize the DPC order with 
the power and transmit filter optimization. Thus finding the 
optimum DPC order remains a combinatorial problem being 
prohibitive even for moderate numbers of users. 

In order to overcome these problems a generic approach 
stemming from an information-theoretic perspective con be 
chosen considering rate constraints instead of SINR constraints 
(in principle, any QoS-measure related to rate via a bijective 
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mapping can be considered). This allows to use recent results 
on duality of MIMO BC and multiple access channel (MAC) 
[4], [5] and sum power minimization for MIMO BC [6]- 
[9]. Using the duality of MISO BC and single-input-multiple- 
output (SIMO) MAC the problem was considered for a fixed 
precoding order in [9] and the same authors showed recently 
that the precoding order can indeed by optimized using a 
sequence of convex programs [10]. 

In this paper we extend the results from [10]. We present 
an efficient algorithm yielding the optimal power allocation, 
transmit filters and precoding order exploiting ideas from [11] 
and [12]. In contrast to the algorithm proposed [10] there is 
no need to combine two nested loops of convex optimization 
problems. We study the cases where a given SINR target is 
achievable with minimum power using time-sharing between 
different precoding orders pointing out that the information- 
theoretic approach can be considered as a relaxation. Further 
we derive necessary and sufficient condition for the optimality 
of a given DPC. These are easy to check since they rely 
on a simple recursive equation. Based on these conditions 
we propose a low complexity algorithm showing very good 
performance. 

The remainder of this paper is organized as follows. Section 
II briefly describes the system model and Section III contains 
the problem statement. Subsequently, the problem is solved 
using a relaxation approach in Section IV. Conditions on 
the optimality of a DPC order are derived and a suboptimal 
algorithm is presented in Section V. Numerical results are 
presented in Section VI and we conclude with Section VII. 

A. Notation 

Sets are given by calligraphic letters, lower case bold letters 
represent vectors and upper case bold letters denote matrices. 
All vectors are column vectors and | • | is the determinant. 
A complex variable c = a + jb is said to be circular 
symmetric complex Gaussian distributed c ~ C/V(0, 1) if its 
real and imaginary part are independently distributed with 
a ~ J\f(0, 1 /2) and b ~ Af(0, 1/2). 

II. System Model 

Consider a MISO downlink, where the transmitter has tit 
antennas and each of the M receivers is equipped with a single 
antenna. Then the signal received by user m can be written as 

r m = h^t + z m (1) 



where h m G 



••tit x 1 



is the channel matrix to user m, t G 



C nrXl is the transmitted symbol vector and z m ~ CJ\f(0, 1) 
is circular symmetric additive white Gaussian noise (AWGN). 
Assuming that the transmitter has perfect channel knowledge, 
DPC can be applied. Let tt(-) be a DPC encoding order such 
that user ir(M) is encoded first, followed by user ir(M — 1) 
and so on. Then the interference caused by users which have 
been already encoded (n > m) can be taken into account 
while encoding the signal of user rn rendering it harmless. 
The dual SIMO MAC is described by the system equation 



y = 



M 

E 



n 



(2) 



where y G C" TXl is the received signal, x m is the symbol 
transmitted by user m and n G C" TXl is again circular 
symmetric AWGN. Assume that the receiver can perform Suc- 
cessive Interference Cancellation (SIC). Choosing a decoding 
order 7r(-) such that user 7r(l) is decoded first followed by user 
7r(2) and so on, the signals of already decoded users n < m 
can be subtracted such that they do not cause any interference 
while decoding user mth signal. 

Recent uplink-downlink duality results state that the capac- 
ity regions of MIMO BC and its dual MIMO MAC under a 
common sum power constraint coincide [5] and each rate tuple 
achievable in the MIMO BC under a certain DPC order 7r(-) 
can be achieved in the MIMO MAC applying the reverse SIC 
order [4]. Exploiting these facts all problems can be solved 
equivalently in the dual channel. Since the SIMO MAC has 
a much more favorable (polymatroid) structure, we will focus 
on the uplink to solve the stated problems. 

III. Problem Statement 

The problem to be studied can be formulated in a signal- 
processing context as well as in an information theoretic 
context. We begin with the signal processing formulation. To 
this end consider the BC system model in (1) and decompose 
the transmitted signal to 



t = Us 



where U = [ui, Um] G C nTX is a matrix of spatial 
beamformers and s G C Mxl is the vector of independent 
transmit symbols. Denoting the transmit power used for the 
symbol of user m with p m = E{|s m | 2 }, the problem can be 
written 

M 

min y^Vm 

u,p^ ^ (4) 

m— 1 

subj.to 7 Tr(m ) >%{ m ) Vrn 

where U = [ui, um] t is a matrix consisting of beamform- 
ers, 7r(-) is a DPC order such that user ir(M) is encoded first 
followed by user tt(M — 1), j m is the SINR target for user 
m and 

'l U f(m) h ^Ml| 2 



i+e: 
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Fig. 1 . Exemplary MISO BC (SIMO MAC) capacity region for random 2 
user channel and 10 dB transmit SNR. Sum capacity polymatroid highlighted 
with rate (black marker) tuple not achievable with a single DPC order. 



This problem has been studied extensively in the signal- 
processing community (see e.g. [3] and references therein) 
and can be solved for any fixed precoding order ir(-) using 
Perron-root techniques and SINR uplink-downlink duality . 

On the other hand (4) can be reformulated in an information- 
theoretic context if Gaussian inputs are assumed. Introducing 
covariance matrices Q m = E{t m t^} with 5Z m=1 t m = t 
and relating SINR to rate using the bijective mapping 



R m = log(l + SINR J; 



yields 



mm 
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subj.to i? 7r(m ) > Rx( m ) Vm 
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(3) where 
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7r(m) 



log- 
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1 + En=m+1 ' 1 f(ti)^»(»)' lT 



(8) 



(») 



and the rate constraints are given by R m = log(l + 7 m ). 

In order to make (7) tractable continuous relaxation can be 
used allowing convex combinations of different DPC orders 
(time-sharing). Combined with duality relations from [4] and 
introducing uplink powers p — \p\, ..,pm] (7) can be refor- 
mulated as a convex program in the uplink [10]: 



M 

min 2_j Pn 

m— 1 



subj.to log 
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p m h m h. 
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VSC {!,..., M}. 



(9) 



Algorithm 1 Minimum Sum Power for Beamforming 

(0) initialize A (0) = [XY iax /2, X^ ax /2] 
while desired accuracy not reached do 
(1) for given A*-"^ solve 



M 



12 P« 



R (n) =argmax X ™ )r 

m—1 m 

1 + J2n= m P^(n)K(n)ii" (n) 



subj.to R n i m ) < log 



(10) 

with DPC order n(-) such that 

\( n ) <? <r \( n ) 

^ •■• - \(M) 

(2) determine a subgradient v £ according to 

v m =Rg>-Rm Vm (11) 

(3) determine new ellipse with new centroid \( n+1 '> 
according to standard ellipsoid algorithm 

end while 



The set of constraints corresponding to all partial sums defines 
the SIMO MAC capacity region. It is important to note that 
(9) is a convex relaxation of the original problems (4) and 
(7) including time-sharing solutions. This is illustrated Fig. 1, 
where an exemplary MISO capacity region is depicted. The 
marked rate tuple is not achievable with a single DPC order 
at 10 dB although it is a solution to (9). 

IV. A Convex Relaxation Approach 

The formulation (9) including all rate tuples within the ca- 
pacity region constitutes the classical sum power minimization 
problem. It was studied for OFDM (a diagonal MIMO system) 
in [13] and solved for the general MIMO case in [6], [7], [14]. 
In principle, (9) is a convex problem and can be solved using 
standard solvers. However, the number of constraints grows 
exponentially with M making an optimization quite complex. 

In [10] an algorithm comprising two nested loops of convex 
problems was proposed. The complexity can be considerably 
reduced using an optimization of the dual function 



,g(A) = inf £(p,A) 

p:p m >0 



(12) 



where A = [Ai, \m] T denotes Lagrangian multipliers 
corresponding to the rate constraints and the Lagrangian is 
given by 



A I 



£(p, A) = ^ Pm - X! ^n{R m - Rm) 



m—1 



and R m is defined in (8). Since the dual (12) is not necessarily 
differentiable, it can be optimized using any optimization 
method relying on subgradients such as the ellipsoid method. 
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Fig. 2. Exemplary convergence of Algorithm 1 for a channel with ut = 3 
and M = 3 users. 



Further there exists an upper bound on the dual parameters 
A and thus an initial ellipse can be found (see [7]) 1 . An 
important observation is that the optimization in (10) can be 
done very efficiently with a modified version of the algorithm 
presented in [11]. Although developed for a fixed power 
budget, it can be modified to solve the problem for fixed A 
due to a simple monotonicity property (details omitted due 
to the lack of space). The convergence process is illustrated 
exemplarily in Fig. 3. 

V. Fixed DPC Order and Suboptimal Algorithm 
A. Fixed ordering 

Assume an arbitrary DPC order 7r(-). For this case the 



original problem (7) simplifies to 



M 
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In contrast to (7), there exists a closed-form solution having 
a recursive structure [9]: 



P-n{va) 
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h f(m) ( r + EZm+1 PA*)^Mi)K(i) 



(14) 



B. Conditions for optimality of tt{-) 

A question of considerable interest is whether a given 
precoding order 7r( ) is optimal or not. 

1 For details concerning the ellipsoid method the interested reader is referred 
to [15]. 



To this end consider the global problem (9). If a precoding 
order n(-) and the corresponding solution p = [pi, ...,Pm] T is 
optimal, the KKT conditions of (9) have to be fulfilled since 
(9) is a convex program. 

First note that the number of active constraints in (9) is 
upper bounded by M due to the polymatroid structure. To be 
more precise define the set 

S K = {S C M : \S\ = K, K < M} 

where | • | is the cardinality of a set. Then the number of active 
constraints S^ ct C S K is at most one: 

|5^| G {0,1} VK 
Time-sharing is needed to achieve a certain rate tuple if 



3 K : = 0, 



(15) 



i.e. for at least one K there is no constraint active. 

In the following we assume that the minimum sum power 
is attained without time-sharing. Then the following Lemma 
states necessary and sufficient conditions on the optimality of 
a DPC order: 

Lemma 1: Let p = [pi, ...,Pm] T be the solution to (13) 
with DPC order 7r(-). Then ir(-) and p are the optimal 
precoding order and power allocation if and only if 



A m — A m _i > 
for all m = 2, M where 



\n — 



and 
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Proof: Consider the Lagrangian of (13): 



(16) 
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with Z m defined in (18). Due to the non-convexity of (13) the 
KKT conditions are necessary but not sufficient. The derivative 
w.r.t. p^ [m) is given by 

dC(p,\,v) 



dp^im) 



= =1 



/ m—l 



T ( m ) 



(m) 



A "i I1 7r(m) Z/ m n Tr(m)- 



Solving for A TO and setting v m = (since p m > 0) yields 
the recursive expression in (17). On the other if 7r(-) is the 
optimal DPC order, the KKT conditions of the convex global 
problem (9) must be fulfilled and exactly the M constraints 
corresponding to the sets 

S m = {tt(1), ...,7t(to)} m = l,...,M 

must be active. A simple expansion of the Lagrangian given 
in (19) leads to 

M 



£(p, A > v ) = Pn 

m—l 
/ M 

+Ai I ^ -Rw(n) _ l°g \Z\\ 



\n=l 



M 



M 



m=2 
M 



+ Y VmVn 



m—l 



which is exactly the Lagrangian for the global problem if 7r(-) 
is the optimal DPC order and where the Lagrangian multipliers 
are given by 



Or, 



X m m = 1 

A TO - A TO _i m = 2,...,M 



The KKT conditions of (9) are fulfilled with all constraints 
specified in (20) being active if and only if 

9 m > for all m 

leading to (16). ■ 
Corollary 1: A DPC order is part of the time-sharing solu- 
tion if 

A m -i ~ A m > 
for all m = 2, M and 

3 m : X m -i - A TO = 
for at least one m. This corresponds to (15). 

C. A low complexity approximation 

In order to develop a low complexity algorithm we exploit 
the fact that there exists an analytic solution for a fixed DPC 
order and the optimality condition in (16) is very easy to check 
due to their simple recursive structure. 

An idea based on the optimality conditions is to resort the 
users according to their Lagrangian multipliers, since they 
reveal the current rate price. Since a user encoded earlier sees 
less interference he needs less power to support a given rate. 
Thus it makes sense to encode users with a high value of 
A TO earlier. Motivated by this argument, Algorithm 2 sorts the 
users according to their Lagrangian multipliers. By doing so, 
different vertices are visited. If the optimality condition in (16) 
is fulfilled or a vertex is revisited the algorithm stops. 



Algorithm 2 Suboptimal Iterative Algorithm 

(0) fix an arbitrary precoding order 7P n J 
while not converged do 

(1) for given n^ n ' find p^™)) by solving minimum sum 
power problem in (13) 

(2) determine new precoding order 7r(" +1 ) such that 



A 



(„) 

7r(«+i)(l) 



< 



< A 



(n) 

7rC"+D(M) 



(20) 



(n) 

where Am is given by (17). 
end while 
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Fig. 3. Rate requirement [bps/Hz] over average transmit SNR [dB] for 
riT = 3 transmit antennas and M = 3 users. 



VI. Simulations 

Figures 3 and 4 illustrate the average transmit power needed 
to support an equal rate for each of three users, where the 
transmitter has nx — 3 antennas. Each channel realization 
was generated according to an iid Rayleigh fading process. 
It can be clearly observed that the exhaustive search over 
all DPC orders does not achieve the curve based on convex 
optimization. This is due to the cases involving time-sharing. 
The suboptimal algorithm performs surprisingly well despite 
its low complexity. All three outperform the randomly chose 
DPC order. Simulations suggest that this gap grows as the 
number of users increases. 

VII. Conclusions 

We studied the problem of finding the optimal resource 
allocation and DPC order for nonlinear downlink beamform- 
ing, if SINR or rate constraints have to be met. An efficient 
algorithm based on convex relaxation was presented, which 
may yield time-sharing solutions. Moreover conditions for the 
optimality of a specific DPC order were derived and based 
on these we proposed a low complexity algorithm showing a 
minimal performance loss. 
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Fig. 4. Enlargement of region marked in Fig. 3. 
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